Modified Recombinase 

The present invention concerns a fusion protein comprising a recombinase 
protein, preferably the site-specific DNA recombinase C31-Int of phage OC31, 
and a peptide sequence which directs the nuclear uptal<e of the fusion protein in 
eucaryotic cells, and the use of this fusion protein to recombine, invert or delete 
DNA molecules containing recognition sequences for said recombinase in 
eucaryotic cells at high efficiency. In addition the invention relates to a cell, 
preferably a mammalian cell which contains recognition sequences for said 
recombinase in its genome and wherein the genome is recombined by the action 
of said fusion protein. Moreover, the invention relates to the use of said cell to 
study the function of genes and for the creation of transgenic organisms to study 
gene function at various developmental stages, including the adult. In conclusion, 
the present invention provides a process which enables the highly efficient 
modification of the genome of mammalian cells by site-specific recombination. 
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Background of the invention 

The controlled and permanent modification of the genome of eucaryotic cells and 
organisms is an important method for research applications, e.g. for studying 
gene function, for medical applications like gene therapy and the creation of 
disease models and for the design of economically important animals and crops. 
The basic methods for genome manipulations by the engineering of endogenous 
genes through gene targeting in murine embryonic stem (ES) cells are well 
established and used since many years (Capecchi, Trends in Genetics, 5, 70-76 
(1989)). Since ES cells can pass mutations induced in vitro to transgenic 
offspring "in vivo it is possible to analyse the consequences of gene disruption in 
the context of the entire organism. Thus, nunnerous mouse strains with 
functionally inactivated genes ("knock-out mice") have been created by this 
technology and utilised to study the biological function of a variety of genes 
(Koller et al., Ann. Rev. Immunol., 10, 705 - 730 (1992)). More importantly, 
mouse mutants created by this procedure (also known as "conventional, 
complete or classical mutants"), contain the inactivated gene in all cells and 



tissues throughout life. Thus, classical mouse mutants represent the best animal 
model for inherited human diseases as the mutation is introduced into the 
germline but are not the optimal model to study gene function in adults, e.g. to 
validate potential drug target genes. 
5 A refined method of targeted mutagenesis, referred to as conditional 
mutagenesis, employs the Cre/loxP site-specific recombination system which 
enables the temporally and/or spatially restricted inactlvation of target genes in 
cells or mice (Rajewsky et al., J. Clin. Invest., 98, 600 - 603 (1996)). The phage 
PI derived Cre recomblnase recognises a 34 bp sequence referred to as loxP site 
10 which is structured as an inverted repeat of 13 bp separated by an asymmetric 8 
j bp sequence which defines the direction of the loxP site. If two loxP sites are 

1 

located on a DNA molecule in the same orientation the intervening DNA sequence 
' is excised by Cre recomblnase from the parental molecule as a closed circle 
j leaving one loxP site on each of the reaction products (Kilby et al., TIG, 9, 413- 
)' 421 (1993)). The creation of conditional mouse mutants initially requires the 
generation of two mouse strains, one containing two or more Cre recombinase 
recognition (!oxP) sites in its genome while the other harbours a Cre transgene. 
' The former strain is generated by homologous recombination in ES cells as 
described above, except that the exon(s) of the target gene is (are) flanked by 

20 two loxP sites which reside in introns and do not interfere with gene expression. 
The Cre transgenic strain contains a transgene whose expression is either 
constitutively active in certain cells and tissues or is inducible by external agents, 
depending on the promoter region used. Crossing of the loxP-flanked mouse 
strain with the Cre recombinase expressing strain enables the deletion of the 

25 loxP-flanked exons in the genome of doubly transgenic offspring in a prespecified 
temporally and/or spatially restricted manner. Thus, the method allows the 
analysis of gene function in particular cell types and tissues of otherwise widely 
expressed genes. I^oreover, it enables the analysis of gene function in the adult 
organism by circumventing embryonic lethality which is often the consequence of 

30 complete (germline) gene Inactlvation. For pharmaceutical research, aiming to 
validate the utility of genes and their products for drug development, gene 
inactlvation which is inducible in adults provides an excellent genetic tool as this 
mimicks the biological effects of target inhibition upon drug application. 
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Since the first description of the concept of conditional gene targeting using the 
Cre/loxP system in mice in 1994 (Gu et al.. Science 265, 103-106 (1994)) this 
method became increasingly popular among the research community and 
resulted in a broad collection of genetic tools for biological research in the mouse. 
More than 30 Cre transgenic mouse strains with various tissue specificities for 
gene inactivation have been created. Including several "deleter" strains which 
allow to remove the loxP-flanked target gene segment in the male or female 
germllne (Cohen-Tannoudji et al., Mol. Hum. Reprod. 4, 929-938 (1998); 
Metzger et a!., Curr. Op. Biotech., 10, 470-476 (1999)). The need to characterise 
the expression pattern of Cre mediated recombination in newly generated strains 
stimulated the construction of a number of "Cre-reporter" strains which harbour 
a silent reporter gene the expression of which is activated upon Cre-mediated 
deletion (Nagy, Genesis, 26, 99-109 (2000)). Conditional mouse mutants have 
been reported for about 20 different genes, many of them could not be studied in 
adults as their complete inactivation leads to embryonic lethality (Cohen- 
Tannoudji et al., Mol. Hum. Reprod. 4, 929-938 (1998)). 

Great efforts have also been made to control the expression of Cre recomblnase 
in an inducible fashion in mice. After the first demonstration that inducible gene 
knock-out is feasible In adult mice using an interferon controlled promoter (Kuhn 
et al.. Science, 269, 1427-1429 (1995)), mainly two methods were applied to 
control the activity of Cre recomblnase. First, it has been demonstrated that the 
fusion of Cre with the ligand binding domain of a mutant estrogen receptor allows 
to control recomblnase activity by a specific steroid-like inducer. Several 
transgenic mouse strains expressing such a fusion protein have been generated 
and allow to induce gene inactivation in specific tissues (Metzger et al., Curr. Op. 
Biotech., 10, 470-476 (1999)). Furthermore, the tetracycline-regulated gene 
expression system has been successfully used to control the expression of Cre in 
transgenic mice and thus provides a second system for inducible gene 
inactivation using doxycycline as inducer (Saam et al., J. Biol. Chem. 274, 
38071-38082 (1999)). 

In addition to the application of Cre/loxP for gene inactivation by deletion of a 
gene segment this recombination system has been proved to be useful also for a 
number of other genomic manipulations in ES cells or mice. These include the 
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conditional activation of transgenes in mice, chromosome engineering to obtain 
deletion, translocation or inversion, the simple removal of selection marker 
genes, gene replacement, the targeted insertion of transgenes and the 
(in)activation of genes by inversion (Nagy, Genesis, 26, 99-109 (2000); Cohen- 
5 Tannoudji et al., Mol. Hum. Reprod. 4, 929-938 (1998)). In conclusion, the 
Cre/loxP recombination system has been proven to be extremely useful for the 
analysis of gene function in mice by broadening the methodological spectrum for 
genome engineering. It can be expected that many of the protocols now 
established for the mouse may be applied in future also to other animals or 
10 plants. 



In contrast to the huge diversity of genome manipulations which have been 
developed for the Cre/loxP system, very limited efforts have been made to 
' develop further site-specific recombination systems for the use in mammalian 
> cells. Alternative recombination systems of different specificity but with an 
efficiency comparable to Cre/loxP could further enhance the flexibility of genome 
engineering by the side to side use of two or more systems in the same cell or 



i organism. Furthermore, unidirectional recombination systems which follow a 
j different mechanism than the reversible Cre/loxP-mediated recombination should 
20 allow to develop new applications for genome engineering which cannot be 
performed with the current systems. 



recombination in mammalian cells are readily explained by a number of 
25 requirements which must be fulfilled for the efficient use of a recombinase in 
mammalian cells: 



proteins, 

ii) It should act independent of the supercoiling status of the target DNA 



iii) it should be efficiently active and stable at a temperature of 37°C, and 

iv) it should recognize a target sequence which is sufficiently long to be 
unique among large genomes, and it should exhibit a very high affinity 
to its target site for efficient action (Kilby et al., TIG, 9, 413-421 



The reasons for the almost exclusive use of the Cre/loxP system for site-specific 



the recombinase should act independent of cofactors like helper 
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and also on mammalian chromatin. 
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(1993)). 



Among the more than 200 described mennbers of the integrase and 
resolvase/invertase recombinase families only the Cre/loxP system is presently 
known to fulfill all of these requirements (Nunes-Duby et al.. Nucleic Acids Res., 
26, 391-406 (1998); Kilby et al., TIG, 9, 413-421 (1993); Ringrose et al., J. Mol. 
Biol., 284, 363 - 384 (1998)). Besides Cre/loxP a few recombinases have been 
shown to exhibit some activity in mammalian ceils but their practical value is 
presently unclear as their efficieny has not been compared to the Cre/loxP 
system on the same genomic recombination substrate and in some cases it is 
known that one or more of the criteria listed above are not met. The best 
characterised examples are the yeast derived FLP and Kw recombinases which 
exhibit a temperature optimum at 30°C but which are unstable at 37°C (Buchholz 
et al.. Nature Biotech., 16, 657 - 662 (1998); Ringrose et al., Eur. J. Biochem., 
248, 903 - 912). For FLP it has been shown in addition that its affinity to the FRT 
target site is much lower as compared to the affinity of Cre to loxP sites 
(Ringrose et al., J. Mol. Biol., 284, 363 - 384 (1998)). Other recombinases which 
show in principle some activity in mammalian cells are a mutant integrase of 
phage X, the integrases of phages OC31 and HK022, mutant y5-resolvase and p- 
recombinase (Lorbach et al., J. Mol. Biol., 296, 1175 - 81 (2000); Groth et al., 
Proc. Natl. Acad. Sci. USA, 97, 5995 - 6000 (2000); Kolot et a!., Mol. Biol. Rep. 
26, 207 - 213 (1999); Schwikardi et al., FEBS Lett., 471, 147 - 150 (2000); Diaz 
et a!., J. Biol. Chem., 274, 6634 - 6640 (1999)). Other phage Integrase systems 
include coliphage P4 recombinase. Listeria phage recombinase, bacteriophage R4 
Sre recombinase, CisA recombinase, XisF recombinase and transposon Tn445i 
TnpX recombinase (Stark et al. Trends In Genetics 8, 432-439 (1992); Hatfull & 
Gridley, in Genetic Recombination. Eds. Kucherlipati & Smith, Am. Soc. 
Microbiol., Washington DC, 357-396 (1988)). 

However, the practical value of these recombinases and integrases for use in 
mammalian cells Is limited as their efficiency to recombine mammalian genomic 
DNA has not been tested or compared with the Cre/loxP system. From the data 
available it can be assumed that these recombinases are much less effective than 
the Cre/loxP system. 




In a few cases attempts have been nnade to improve the performance of 
recombinases in mammalian cells: for FLP a mutant showing improved 
thermostability and acticity at 37°C has been isolated but this mutant Is still 
considerably more heat labile as compared to Cre (Buchholz et a!.. Nature 
5 Biotech., 16, 657 - 662 (1998)). In the case of ?.-integrase and y5-resolvase the 
absolute requirement for coproteins and supercoiled DNA could be eliminated by 
the introduction of specific point mutations (Schwikardi et al. FEBS Lett 471, 
ppl47-50 (2000)). 

;0 The import of cytoplasmic proteins into the nucleus of eucaryotic cells through 
: nuclear pores is a regulated, energy dependent process mediated by specific 
: receptors (Gorlich et al.. Science, 271, 1513 - 1518 (1996)). Proteins which do 

not posses a signal sequence recognised by the nuclear import machinery are 
■ excluded from the nucleus and remain In the cytoplasm. Numerous of such 
;5 nuclear localisation signal sequences (NLS), which share a high proportion of 

basic amino acids in common, have been characterised (Boullkas, Crit. Rev. 

Eucar. Gene Expression, 3, 193 - 227 (1993)), the prototype of which is the 7 

amino acid NLS derived from the T-antlgen of the S\/40 virus (Kalderon et/al. 

Cell, 39, 499 - 509 (1984)). 

20 

It was believed that the fusion of such an NLS peptide to a recomblnase possibly 
. would enhance the efficiency of the recomblnase by mediating its import into the 
nucleus and therewith Increasing the concentration of the recomblnase inside the 
nucleus. However, for Cre recomblnase It has been shown that the addition of 

25 the SV-40 T-antlgen NLS does not improve Its recombination efficiency in 
mammalian cells (Le et al.. Nucleic Acid Res., 27, 4703 -4709 (1999)). 
Nevertheless, both Cre and a Cre-NLS-fusion protein are widely used. Schwikai-di 
(Schwikardi et al., FEBS Lett. 471, ppl47-50 (2000)) reported a y5-resolvase-SV- 
40 T-antigen NLS fusion protein, which also did not enhance the recombination 

30 efficiency. 

The level of activity exhibited by recombinases of diverse prokaryotlc origin in 
mammalian cells may be the result of the intrinsic properties of an enzyme 
depending on parameters like its temperature optimum. Its target site affinity, 
35 protein structure and stability, the degree of cooperativity, the stability of the 




synaptic complex and the dependence on coproteins or supercoiled DNA. Within 
the specific environment of mammalian cells the activity of a prokaryotic 
recombinase could be limited by additional factors such as a short half-life of the 
recombinase transcript, a short half-life of its protein, its inability to act on 
5 histone-complexed and higher order structured mammalian genomic DNA, 
exclusion from the nucleus or the recognition of cryptic splice sites within its 
mRNA resulting in a nonfunctional transcript. Due to the lack of information on 
the parameters listed above for almost all recombinases it is presently not 
possible to rationally optimise their performance in mammalian cells. 

10 

Summary of the Invention 

\ The object to be solved by the invention of the present application is the 
: provision of a recombination system alternative to the Cre/loxP system, which 

15 has a different specificity but an efficiency comparable to Cre/loxP. Such an 

alternative recombination system is particularly desirable for all those 
i applications which require more than one potent recombination system for being 

successfully carried out (e.g. the methods disclosed in PCT/EPO 1/00060 and 

PCT/EPOO/10162). Most surprisingly, it was found that the above object can be 
20 solved by fusing a signal peptide capable directing the nuclear import 

(hereinafter shortly referred to as nuclear localisation signal sequences (NLS)) to 

specific recombinases. 

In contrast to the wlldtype recombinases, the resulting modified recombinases 
25 allow a highly efficient recombination of extrachromosomal and chromosomal 
DNA In mammalian cells, and a highly efficient excision of extrachromosomal and 
chromosomal DNA-stretches, which are flanked by suitable recognition sites for 
said modified recombinases. 

30 The present invention thus provides: 

(1) A fusion protein (hereinafter also referred to as "modified recombinase") 
comprising 

(a) a recombinase domain comprising a recombinase protein or fragment thereof 
and 

35 (b) a signal peptide domain being linked to (a) and directing the nuclear import 
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of said fusion protein in eucaryotic cells, 

preferably the activity of the fusion protein in eucaryotic cells is significantly 
higher as compared to the acitivity of the wildtype recombinase corresponding 
to the recombinase of the recombinase domain; 

(2) in a preferred embodiment of the fusion protein defined in (1) above, the 
recombinase domain comprises an integrase protein, preferably a phage OC31 
integrase (C31-Int) protein or a mutant thereof; 

(3) a DNA coding for the fusion protein as defined in (1) or (2) above; 

(4) a vector containing the DNA as defined in (3) above; 

(5) a microorganism containing the DNA of (3) above and/or the vector of (4) 

above; 

(6) a process for preparing the fusion protein as defined in (1) or (2) above 
which comprises culturing a microorganism as defined in (5) above; 

(7) the use of the fusion protein as defined In (1) or (2) above to recomblne DNA 
molecules, which contain recombinase recognition sequences for the 
recombinase protein of the recombinase domain. In eucaryotic cells; 

(8) a cell, preferably a mammalian cell containing the DNA sequence of (3) above 
in its genome; 

(9) the use of the cell of (8) above for studying the function of genes and for the 
creation of transgenic organisms; 

(10) a transgenic organism, preferably a transgenic mammal containing the DNA 
sequence of (3) above In its genome; \. 

(11) the use of the transgenic organism of (10) above for studying gene function 
at various developmental stages; and 

(12) a method for recombining DNA molecules of cells or organisms containing 
recognition sequences for the recombinase protein of the recombinase domain as 
defined in (1) or (2) above, which method comprises supplying the cells or 
organisms with a fusion protein as defined in (1) or (2) above, or with a DNA 
sequence of (3) above and/or a vector of (4) above which are capable of 
expressing said fusion protein in the cell or organism. 

The present invention combines the use of proI<aryotic recombinases such as the 
C31-Int with a eukaryotic signal sequence which increases its efficiency in 
mammalian cells such that it is equal to the widely used Cre/loxP recombination 
system. The improved recombination system of the present invention provides 



an alternative recombination system for use In mammalian cells and organisms 
which allows to perform the same types of genomic modifications as shown for 
Cre/loxP, Including conditional gene Inactivation by recombinase-mediated 
deletion, the conditional activation of transgenes in mice, chromosome 
engineering to obtain deletion, translocation or inversion, the simple removal of 
selection marker genes, gene replacement, the targeted insertion of transgenes 
and the (in)actlvatlon of genes by Inversion. 

Short Description of Figures 

^'9- 1- C31-Int and Cre recomblnase expression vectors and a recomblnase 
reporter vector used for transient and stable transfectlons 

Fig- 2: Results of transient transfectlons of C31 Int and Cre expression vectors 
and reporter vectors into CHO cells. 

Fig- 3: Results of transient transfectlons of XlsA and Ssv recomblnase 
expression vectors with and without nuclear localisation signals and reporter 
vectors Into CHO ceils. 

Fig- 4: Results of transient transfectlons of C31-Int and Cre recomblnase vectors 
Into a stable reporter cell line. 

Fig- 5: In situ detection of B-galactosidase In 3T3(pRK64)-3 cells transfected with 
recomblnase expression vectors 

Fig. 6: Test vector for C31-Int mediated deletion, pRK64, and the expected 
deletion product. 

I^'g- 7: PCR products generated with the primers P64-1 and P64-4 and sequence 
comparison. 

Fig^ ROSA26 locus of the C31 reporter mice carrying a C31 reporter construct. 

Elg^_9i In situ detection of p-galactosidase In a cryosection of the testis of: (A) a 
double transgenic mouse carrying both the recomblnase and the reporter; and 



(B) a transgenic mouse carrying only the reporter as a control. 
Detailed Description of the Invention 

The "organisms" according to the present invention are multi-cell organisms and 
5 can be vertebrates such as mammals (humans and non-human animals including 
rodents such as mice or rats) or non-mammals (e.g. fish), or can be 
invertebrates such as insects or worms, or can be plants (higher plants, ajgl or 
fungi). Most preferred living organisms are mice and fish. 

10 "Cells" and "eucaryotic cells" according to the present invention include cells 
isolated from the above defined living organism and cultured in vitro. These cells 
can be transformed (immortalized) or untransformed (directly derived from the 

: living organism; primary cell culture). 

[5 "Microorganism" according to the present invention relates to procaryotes (e.g. 
E. coli) and eucaryotic microorganisms (e.g. yeasts). 

According to embodiment (1) of the present invention/ the activity of the fusion 
i protein in eucaryotic cells is significantly higher as compared to the acitivity of 
iO the wildtype recombinase corresponding to the recombinase of the recombinase 
domain. A "significantly higher activity" in accordance with the present invention 
refers to an increase in activity of at least 50%, preferably at least 75%, more 
preferably at least 100%. relative to the corresponding wildtyp recombinase in 
eucaryotic cells. A "significantly higher activty" also implies that the resulting 
25 fusion protein has at least 25%, preferably at least 50% and more preferably at 
least 75%, of the activity of Cre/loxP in 3T3 cells with a stably integrated target 
sequence. 

Recombinase proteins which can be used in the recombinase domain of the 
30 fusion protein of the present invention (i.e., giving a fusion having a "significantly 
higher activty" as defined above) include , but are not limited to, a certain type 
of recombinases belonging to the family of of large serine recombinases (Thorpe 
et al., Control of directionalty in the site-specific recombination system of the 
streptomyces phage <|)C31, Molecular Microbiology 38(2), 232-241 (2000)). This 
35 family includes bacteriophage <1»C31 integrase ("C31-Int"; the amino acid 
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sequence of said integrase and a DNA sequence coding therefor are shown in 
SEQ ID NOs:21 and 20, respectively), coliphage P4 recombinase, Listeria phage 
recombinase, bacteriophage R4 Sre recombinase ("R4 Sre" deposited under GI 
793758; the amino acid sequence of said recombinase and a DNA sequence 
coding therefor are shown in SEQ ID NOs:55 and 54, respectively), bacillus 
subtilis CisA recombinase ("CisA" deposited under GI 142689; the amino acid 
sequence of said recombinase and a DNA sequence coding therefor are shown in 
SEQ ID NOs:57 and 56, respectively), XisF recombinase from annabaena sp. 
Strain PCC 7120 (Cyanobacterium; "XisF" deposited under GI 349678; the amino 
acid sequence of said integrase and a DNA sequence coding therefor are shown 
in SEQ ID NOs:59 and 58, respectively), transposon Tn4451 TnpX recombinase 
("TnpX" deposited under GI 551135; the amino acid sequence of said 
recombinase and a DNA sequence coding therefor are shown in SEQ ID N0s:61 
and 60, respectively), "XisA" recombinase from annabaena sp. Strain PCC 7120 
(Cyanobacterium; the amino acid sequence of said recombinase and a DNA 
sequence coding therefor are shown in SEQ ID NOs:63 and 62, respectively), 
"SSV" recombinase from phage of sulfolobus shibatae (the amino acid sequence 
of said recombinase and a DNA sequence coding therefor are shown In SEQ ID 
NOs:65 and 64, respectively), lactococcal bacteriophage TP901-1 recombinase 
(TP901-1 complete genome deposited under GI 13786531; the amino acid 
sequence of said recombinase and a DNA sequence coding therefor are shown in 
SEQ ID NOs:108 and 107, respectively), and the like, or mutants thereof. Other 
procaryotic recombinases known in the art are also applicable. 

A "mutant" of the above recombinases in accordance with the present invention 
reliates to a mutant of the respective original (viz, wild-type) recombinase having 
a recombinase activity similar (e.g. at least about 90%) to that of said wild-type 
recombinase. Mutants include truncated forms of the recombinase (such as N- or 
C-terminal truncated recombinase proteins), deletion-type mutants (where one 
or more amino acid residues or segments having more than one continuous 
amino acid residue have been deleted from the primary sequence of the wildtyp 
recombinase), replacement-type mutants (where one or more amino acid 
residues or segments of the primary sequence of the wildtyp recombinase have 
been replaced with alternative amino acid residues or segments), or 
combinations thereof. 



According to embodiment (2) of the invention, tine recombinase domain 
comprises an Integrase protein, preferably a phage <1>C31 integrase (C31-Int) 
protein or a mutant thereof. Thus, the present invention provides a fusion protein 
5 comprising 

(a) an integrase domain being a C31-Int protein or a mutant thereof, and 

(b) a signal peptide domain being linked to (a) and directing the nuclear import 
of said fusion protein into eucaryotic cells. 

'lO In the fusion protein of embodiment (2), the integrase domain is preferably a 
C31-Int having the amino acid sequence shown in SEQ ID NO:21 or a C-terminal 
truncated form thereof. Suitable truncated forms of the C31-Int comprise amino 
acid residues 306 to 613 of SEQ ID NO:21. 

(5 The signal peptide domain (hereinafter also referred to as "NLS") is preferably 
derived from yeast GAL4, SKI3, L29 or histone H2B proteins, polyoma virus large 

I T protein, VPl or VP2 capsid protein, SV40 VPl or VP2 capsid protein, Adenovirus 
Ela or DBP protein, influenza virus NSl protein, hepatitis virus core antigen or 

, the mammalian lamin, c-myc, max, c-myb, p53, c-erbA, jun. Tax, steroid 

i 

20 receptor or Mx proteins (see Boulikas, Crit. Rev. Eucar. Gene Expression, 3, 193 

- 227 (1993)), simian virus 40 ("SV40") T-antigen (Kalderon et. al. Cell, 39, 499 

- 509 (1984)) or other proteins with known nuclear localisatlpn: v;:Jhe NLS is 
preferably derived from the S\/40 T-antigen. 

25 Furthermore, the signal peptide domain preferably has a length of 5 to 74, 
preferably 7 to 15 amino acid residues. More preferred is that the signal peptide 
domain comprises a segment of 6 amino acid residues wherein at least 2 amino 
acid residues, preferably at least 3 amino acid residues are positively charged 
basic amino acids. Basic amino acids include, but are not limited to, Lysin, Arginin 

30 and Histidine. Particularly preferred signal peptides are show in the following 
table. 
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Organism 
yeast GAL4 
yeast SKI3 



Sequence/(SEQ ID NO:) 
MKxllCRLKKLKCSKEKPKCAKCLKx5Rx3KTKR (24) 
IKYFKKFPKD (25) 
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yeast L29 


MTGSKTRKHRGSGA 


(26) 






(MTGSKHRKHPGSGA) 


(27) 




yeast histone H2B 


(GKKRSKA) 


(28) 




polyoma virus large T protein 


(PKKAREDVSRKRPR) 


(29) 


5 


polyoma virus VPl capsid protein 


(APKRKSGVSKC) 


(30) 




polyoma virus VP2 capsid protein 


(EEDGPQKKKRRL) 


(31) 




SV40 VPl capsid protein 


(APTKRKGS) 


(32) 




SV40 VP2 capsid protein. 


(PNKKKRK) 


(33) 




Adenovirus Ela protein 


(KRPRP) 


(34) 


10 




(CGGLSSKRPRP) 


- (35) 




Adenovirus DBP protein 


(PPKKRMRRRIEPKKKKKRP) 


(36) 




influenza virus NSl protein 


(PFLDRLRRDQK) 


(37) 






(PKQKRKMAR) 


(38) 




human laminA 


(SVTKKRKLE) 


(39) 


1.5 


human c-myc 


(CGGAAKRVKLD) 


(40) 






(PAAKRVKLD) 


(41) 


1 




(RQRRNELKRSP) 


(42) 


i 


HUMAN max 


(PQSRKKLR) 


(43) 




HUlviAN c-myb 


(PLLKKIKQ) 


(44) 


20 


HUMAN p53 


(PQPKKKP) 


(45) 




HUMAN c-erbA 


(SKRVAKRKL) 


(46) 




VIRAL jun 


(ASKSRKRKL) 


(47) 




HUMAN Tax 


(GGLCSARLHRHALLAT) 


(48) 




Mammalian glucocorticoid receptor 


(RKTKKKIK) 


(49) 


25 


HUMAN ANDROGEN RECEPTOR 


(RKLKKLGN) 


(50) 




MAMMALIAN ESTROGEN RECEPTOR 


(RKDRRGGR) 


(51) 




Mx proteins 


(DTREKKKFLKRRLLRLDE) 


(52) 




SV40 T-antigen 


(PKKKRKV) 


(53) 



30 The most preferred signal peptide domain is that of SV40 T-antigen having the 
sequence Pro-Lys-Lys-Lys-Arg-Lys-Val. 

The signal peptide domain may be linked to the N-terminal or C-terminal of the 
integrase domain or may be integrated into the integrase domain, preferably the 
35 signal peptide domain is linked to the C-terminal of the integrase domain. With 



regard to phage 0031 integrase protein of embodiment (2) of the invention It 
was found that the fusion of an NLS-peptide to the C-terminus of the integrase 
provided a much higher increase of activity as compared to the fusion of the 
same IMLS-peptide to the N-terminus of the integrase (see Example 1, figures 3 
5 and 4). 

According to the present invention, the signal peptide domain may be linked to 
the Integrase domain directly or through a linker peptide. Suitable linkers include 
peptides having from 1 to 30, preferably 1 to 15 amino acid residues, said amino 
10 acid residues being essentially neutral amino acids such as Gly, Ala and Val. 

I The most preferred fusion protein of the present invention comprises the amino 
I acid sequence shown in SEQ ID NO: 23 (a suitable DNA sequence coding for said 
fusion protein being shown in SEQ ID NO:22). 

15 

j Further preferred fusion proteins of the present invention are "NLS-XisA" and 
j "NLS-SSV" (having the NLS-peptide fused to the N-terminus of the 
recombinases) as shown In SEQ ID NO:67 and 69, respectively (suitable DNA 
sequences coding for said fusion proteins being shown in SEQ ID NO:66 and 68, 
20 respectively). 

In embodiments (7), (8), (10) and (12) of the invention the DNA molecules, the 
cell or transgenic organism may also contain recognition sequences for the 
recombinase protein of the recombinase domain. Thus, when utilizing the fusion 
25 protein of embodiment (2), the C31-Int recognition sequences attP and attB are 
present in DNA molecules, the cell or transgenic organism. 

The term "mammal" as used in embodiment (10) of the invention Includes non- 
human mammals (viz. animals as defined above) and humans (If such subject 
30 matter is patentable with the respective patent authority). 

Since the modified recombinase of the invention, in particular the modified C31- 
Int, acts In mammalian cells as efficient (or at least almost as efficient) as the 
widely used Cre/loxP system it can be used for a large variety of genomic 
35 modifications (Including the methods disclosed In PCT/EPO 1/00060 and 
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PCT/EPOO/10162, the content of which is herewith incorporated by reference). 
Concerning embodiment (11) it is to be noted that the mammals of embodiment 
(10) can be used to study the function of genes, e.g. in mice, by conditional gene 
targeting. For this purpose suitable recognition sequences - when utilizing the 
fusion protein of embodiment (2), one attP and one attB site (C31-Int recognition 
sequences) in the same orientation - can be introduced into introns of a gene by 
homologous recombination of a gene targeting vector in ES cells such that the 
two sites flank one or more exons of the gene to be studied but do not interfere 
with gene expression. A selection marker gene, needed to isolate recombinant ES 
cell clones, can be flanked by two recognition sites of another recombinase such 
as loxP or FRT sites to enable deletion of the marker gene upon transient 
expression of the respective recombinase in ES cells. These ES cells can be used 
to generate germline chimaeric mice which transmit the target gene modified by 
att sites to their offspring and allow to establish a modified mouse strain. The 
crossing of this strain with a C31-Int recombinase transgenic line or the 
application of C31-Int protein will result in the deletion of the att-flanked gene 
segment from the genome of doubly transgenic offspring and the inactivation of 
the target gene in doubly transgenic offspring in a prespecified temporally and/or 
spatially restricted manner. The C31-Int transgenic strain contains a transgene 
whose expression is either constitutively active in certain cells and tissues or is 
inducible by external agents, depending on the promoter region used. If an attB 
and an attP site are placed into the genome in opposite orientation C31-Int 
mediated recombination results in the irreversible inversion of the flanked gene 
segment leading the functional loss of on or more exons of the target gene. 
Thus, the method allows the analysis of gene function in particular cell types and 
tissues of otherwise widely expressed genes and circumvents embryonic lethality 
which is often the consequence of complete (germline) gene inactivation. For the 
validation of genes and their products for drug development, gene inactivation 
which is inducible in adults provides an excellent genetic tool as this mimicks the 
biological effects of target inhibition upon drug application. If a pair of attS/P 
sites is placed in the same or opposite orientation into a chromosome at large 
distance using two gene targeting vectors, C31-Int recombination allows to 
delete or invert chromosome segments containing one or more genes, or 
chromosomal translocations if the two sites are located on different 
chromosomes. In another application of the method a pair of attB/P sites is 



placed in the same orientation within a transgene such that the deletion of the 
att-flanked DNA segment results in gene expression, e.g. of a toxin or reporter 
gene for cell lineage studies, or in the inactivation of the transgene. 

In addition, according with embodiment (12) of the invention, the recombination 
system of embodiment (1), in particular the C31-Int recombination system of 
embodiment (2), can also be used for the site specific integration of foreign DNA 
into the genome of mammalian cells, e.g. for gene therapy. For this purpose, and 
if the C31-Int recombination system of embodiment (2) is utilized, only one attB 
(or attP) site is initially introduced into the genome by homologous 
recombination, or an endogenous genomic sequence which resembles attB or 
attP is used . The application of a vector containing an attP (or attB) site to such 
cells or mice in conjunction with the expression of C31-Int recombinase will lead 
to the site specific integration of the vector into the genomic att site. 

Thus, the present invention provides a process which enables the highly efficient 
modification of the genome of mammalian cells by site-specific recombination. 
Said process possesses the following advantages over current technology: 

(i) the modified recombinase, in particular the modified C31-Integrase, allows 
to recombine extrachromosomal and genomic DNA in mammalian cells at 
much higher efficiency as compared to the use of its wildtype form; 

(ii) the modified recombinase, in particular the modified C31-Integrase, is the 
first described alternative recombination system with equal efficiency to 
Cre/loxP for the recombination of chromosomal DNA in mammalian cells. 

The appended figures further explain the present invention: 

Figure 1 shows C31-Int and Cre recombinase expression vectors and a 
recombinase reporter vector used for transient and stable transfections. 
A-D: Mammalian expression vectors for recombinases which contain the CMV 
immediate early promoter followed by a hybrid intron, the coding region of the 
recombinase to be tested, and an artificial polyadenylation signal sequence (pA). 
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A: pCMV-C31Int(wt) containing the nonmodified (wildtype) 1.85 kb coding region 
of C31-Int as found in the genome of phage cdx31. 

B: pCMV-C31Int(NNLS) containing a modified C31-Int gene coding for the full 
length C31-Int protein with a N-terminal fusion to the SV40 virus large T antigen 
5 nuclear localisation signal (NLS). 

C: pCMV-C31Int(CNLS) containing a modified C31-Int gene coding for the full 
length C31-Int protein with a C-terrninal fusion to the SV40 virus large T antigen 
nuclear localisation signal (NLS). 

D: pCMV-Cre contains the 1.1 kb Cre coding region with an N-terminal fusion to 
]0 the SV40 T antigen NLS. 

E: Recombination substrate vector pRK64 contains a SV40 promoter region 

followed by a 1.1 kb cassette consisting of the coding region of the puromycin 
i resistance gene and a polyadenylation signal sequence, flanked 5' by the 84 bp 

attB and 3 ' by the 84 bp attP recognition site of C31-Int. pRK64 contains in 
,15 addition two Cre recognition (loxP) sites in direct orientation next to the att sites. 

Figure 2 shows results of transient transfections of C31-Int and Cre recombinase 
and reporter vectors into CHO cells. 
; All transfections were performed with a fixed amount of the reporter plasmid 
20 pRK64 and 0.5 ng or 1 ng of the recombinase expression plasmids pCMV-C31- 
Int(wt) (samples 4-5), pCMV-C31-Int(NNLS) (samples 6-7), pCMV-C31- 
Int(CNLS) (samples 8-9) or pCMV-Cre (samples 10-11). Negative controls: 
transfection with pRK64 (sample 3) or pUC19 alone (sample 1). Positive control: 
transfection with the Cre-recombined reporter pRK64(ACre) (sample 2). 
25 The vertical rows show the mean values and standard deviation of "Relative Light 
Units" obtained from lysates with the assay for B-galactosidase (RLU (6-Gal)), 
the RLU from the assay for Luciferase, the ratio of the 6-galactosidase and 
Luciferase values with standard deviation (RLU x 10^ (Gal/Luc)), and the relative 
activity of the various recombinases as compared to the positive control defined 



Figure 3 shows results of transient transfections of XisA and Ssv recombinases 
and reporter vectors into CHO cells. 

All transfections were performed with fixed amounts of the reporter plasmids 
35 pPGKnif (for XisA) and pPGKattA (for SSV) and 25 ng or 100 ng of the 



30 



as 1. 
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recombinase expression plasmids pCMV-XisA, pCMV-XisA(NNLS) and 10 ng or 20 
ng of the expression plasmids pC!V|V-Ssv and pCI^V-Ssv(NNLS). Negative 
controls: transfection with pPGKnif or pPGKattA alone. 

The vertical rows show the mean values and standard deviation of "Relative Light 
Units" obtained from lysates with the assay for B-galactosidase (RLU (6-Gal)), 
the RLU from the assay for Luciferase, the ratio of the 6-galactosidase and 
"Luclferase" values with standard deviation (RLU x 10^ (Gal/Luc)). 

Figure 4 shows results of transient transfections of recombinase vectors into a 
stable reporter cell line. 

All transfections were performed with a NIH 3T3 derived clone containing stably 
integrated; copies of the pRK64 recombination substrate vector. Either 32 ng or 
64 ng of the recombinase expression plasmids pCMV-C31-Int(wt) (samples 2-3), 
pCMV-C31-Int(NNLS) (samples 4-5), pCMV-C31-Int(CNLS) (samples 6-7) or 
pCMV-Cre(NNLS) (samples 8-9). Negative control: transfection with pUC19 alone 
(sample 1). 

The vertical rows show the mean values and standard deviation of "Relative Light 
Units" obtained from lysates with the assky for 6-galactosidase (RLU (6-Gal)) and 
the relative activity of the various recombinases as compared to the value 
obtained with pCMV-Cre(NNLS) defined as 1. 

Figure 5 shows the in situ detection of 6-galactosidase in 3T3(pRK64)-3 cells 
transfected with recombinase expression vectors. ' ^ 

The Cre and C31-Int recombinase reporter cell line 3T3(pRK64)-3 was either not 
transfected with DNA (A), transfected with the Cre expression vector pCMV-Cre 
(B) or with the C31-Int expression vector pCMV-C31-Int(CNLS). Two days after 
tranfection the cells were fixed and incubated with the histochemical X-Gal assay 
which develops a blue stain in B-galactosidase positive cells indicating 
recombinase mediated activation of the reporter gene. 

'^' gu''g 6 shows the test vector for C31-Int mediated deletion, pRK64, and the 
expected product of deletion, pRK64(AInt). 

Plasmid pRK64 contains the 1,1 kb cassette of the coding region of the 
puromycin resistance gene and a polyadenylation signal, which is flanked 5' by 
the 84 bp attB and 3 ' by the 84 bp attP recognition site (large triangles) of C31- 
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Int. These attB and attP sites are oriented in the same way to each other (thicl< 
blacl< arrows) which is used by the OXSl phage to integrate into the bacterial 
genome. In addition, the cassette is flanl<ed by two Cre recombinase recognition 
(loxP) sites in the same orientation (black small triangles). For better orientation 

5 the half sites of the att sequences are labelled by a direction (thin arrow) and 
numbered 1-4. The 3 bp sequence within the att sites at which recombination 
occurs is framed by a box. The positions at which the PGR primers P64-1 and 
P64-4 hybridise to the pRK64 vector are indicated by arrows, pointing into the 
3' direction of both oligonucleotides. 
10 PRK64(AInt) depicts the deletion product expected from the C31-Int mediated 
recombination between the att sites of pRK64. The recombination between a pair 
of attB/attP sites generates an attR site remaining on the parental DNA molecule 
while the puromycin cassette is excised. In this configuration the primers P64-1 

' and P64-4 will amplify a PGR product of 630 tap from pRK64(AInt). 



Figure 7 shows PGR products generated with the primers P64-1 and P64-4 and a 
sequence comparison of the PGR product. 



I A: Analysis of PGR products on an agarose gel from PGR reactions using the 
I Primers P64-1 and P64-4 on DNA extracted from MEF5-5 cells transfected 2 days 
•> before with plasmid pRK64 alone (lane 4), with pRK64 + GMV-Cre (lane 3), with 
pRK54 + pGIMV-G31-Int(wt) (lane 2), and from a control reaction which did not 
contain cellular DNA (lane 1). The product with an apparent size around 650. bp, 
as compared to the size marker used, from lane 2 was excised from the agarose 
gel and purified. The PGR product was cloned into a sequencing plasmid vector 
25 and gave rise to the plasmid pRKSOd. The insert of this plasmid was sequenced 
using reverse primer (seqSOd) and compared to the predicted sequence of the 
pRK64 vector after C31-Int mediated deletion of the att flanked cassette, 
pRK64(Alnt). The cloned PGR product shows a 100% identity with the predicted 
attR sequence after deletion. The generated attR site is shown in a box, with the 
30 same sequence designation used in Figure 5. The nucleotide positions (pos.) of 
the compared sequences pRK64(AInt) and SeqSOd are indicated. 

Figure 8 shows the modified ROSA26 locus of G31 reporter mice (Seq ID 
NO: 106). A recombination substrate has been inserted in the ROSA26 locus. The 
35 substate consists of a splice acceptor (SA) followed by a cassette consisting of 
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the hygromycin resistance gene driven by a PGK promoter and flanked by the 
recombination sites attB and attP. In addition the reporter contains two Cre 
recognition sites (loxP) in direct orientation next to the att sites. This cassette is 
followed by the coding region for p-galactosidase, which is only expressed when 
5 the hygromycin resistance gene has been deleted by recombination. 

Figure 9 shows the in situ detection of p-galactosidase activity. A cryosection of 
the testis of a double transgenic mouse carrying both the C31-int recombinase 
and the recombination substrate was stained with X-Gal (A). The blue colour 

10 indicates recombination of the substrate, which leads to the expression of p- 
galactosidase. As a control a cryosection of testis of a transgenic mouse carrying 

j only the recombination substrate was stained with X-Gal (B). 

The present invention is further illustrated by the following Examples which are, 
5 however, not to be construed as to limit the invention. 

Examples 

Example 1 

"20 As compared to Cre recombinase the wildtype form of C31-Int exhibits a 
significantly lower recombination activity in mammalian cells which falls in the 
range of 10 - 40% of Cre, depending on the assay system used (see below). As 
a measure which may increase C31-Int efficiency in eukaryotic cells we designed 
mammalian expression vectors for N- or C-terminal fusion proteins of C31-Int 

25 with a peptide was designed which is recognised by the nuclear import 
machinery. The recombination efficiency obtained by this modified C31-Int 
recombinase in mammalian cells was comparied side by side to the unmodified 
(wildtype) form of C31-Int and to Cre recombinase. For the quantification of 
recombinase activities the expression vectors were transiently introduced into a 

30 mammalian cell line together with a reporter vector which contains C31-Int and 
Cre target sites and leads to the expression of 6-galactosidase upon recombinase 
mediated deletion of a vector segment flanked by recombinase recognition sites. 



A. Plasmid constructions: 
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Construction of the recombination test vectors pPGKnif and pPGKattA: first a nifD 
site (Haselkorn, Annu Rev.Genet. 26, 113-130 (1992)) generated by the 
annealing of the two synthetic oligonucleotides nifD3 (SEQ ID IMO:89) and nifD4 
(SEQ ID NO: 90), was ligated into the BamHI restriction site of the vector PSV- 
5 Paxl (Buchholz et al.. Nucleic Acids Res., 24, 4256-4262 (1996)), 3' of its 
puromycin resistance gene and loxP site, giving rise to plasmid pPGKnifDB' (SEQ 
ID NO:79). Next, another nifD site, generated by the annealing of the two 
synthetic oligonucleotides nifDl (SEQ ID NO:87) and nifD2 (SEQ ID NO:88), was 
ligated into the BstBI restriction site of plasmid pPGKnifD3', upstream of the 
10 puromycin resistance gene and loxP site, giving rise to plasmid pPGKnifD (SEQ ID 
! NO: 78). For pPGKattA (l^uskheiishvili et al., l^lol.Gen.Genet. 237, 334-342 
(1993)) first a 352bp-fragment was amplified from genomic DNA from the 
thermophilic bacterium Sulfolobus shibatae (DSI^-5389, DSMZ, Braunschwelg- 
Deutsche Sammlijng von Mikroorganismen und Zellkulturen GmbH, Mascheroder 
15 Weg lb, D-38124 Braunschweig, Germany) with oligonucleotides SSV5 (SEQ ID 
, NO:96) and SSV6 (SEQ ID NO:97) including restriction sites for BamHI and 
! BstBI. The amplified fragment was cloned into the BamHI site of the vector PSV- 
Paxl giving rise to plasmid pPGKattAl (SEQ ID NO:82), subsequently the same 
352 bp-fragment was cloned into the BstBI site of pPGKattAl giving rise to the 



20 plasmid pPGKattA2 (SEQ ID NO:83). The sequence and orientation of both nifD 
sites and attA sites was confirmed by DNA sequence analysis. In 
pPGKnifD/pPGKattA2 the newly cloned nifD/attA sites (positions 535-619 and 
1722-1787/ positions 6718-7081 and 12-363) are In the same orientation 
flanking the puromycin resistance gene and the SV40 early polyadenylation 

25 sequence. The nifD/attA sites are followed by loxP sites in the same orientation 
(positions 623 - 656 and 1794 - 1827/ positions 7085-7118 and 369-402). The 
puromycin cassette is transcribed from the SV40 early enhancer/promoter region 
and followed by the coding region for E. coli 13-galactosidase and the SV40 late 
region polyadenylation sequence. 



Construction of XisA and SSV expression vectors: First the XisA gene of 
cyanobacterium PCC7120 was amplified by PCR from genomic DNA from Nostoc 
strain PCC7120 (CNCM-Collection Nationale de Cultures de Microorganismes, 
Institut Pasteur, Paris) using the primers XisAl (SEQ ID NO: 84) and XisA3 (SEQ 
35 ID NO:86), and XisAl (SEQ ID NO;84) and XisA2 (SEQ ID NO:85) (with NLS), 
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The ends of the PGR product were digested with NotI and the product was ligated 
into plasmid pBluescript II KS, opened with NotI, giving rise to plasmids pRK42a 
and pRK43 (with NNLS). The DNA sequence of the insert was determined and 
found to be identical to the published XisA sequence (Genbank GI: 3953452) 
apart from four silent point mutations. The XisA gene was Isolated as a 1.4 kb 
fragment from pRK42a and pRK43 by digestion with NotI and ligated into the 
generic mammalian expression vector pRK50 (see below), opened with NotI, 
giving rise to the XisA expression vectors pCMV-XisA (SEQ ID NO:76) and pCMV- 
XlsA(NNLS) (SEQ ID NO:77). pCMV-XisA(wt) contains a Cytomegalovirus 
immediated early gene promoter (position 1 - 616), a 240 bp hybrid intron 
(position 716 - 953), the XisA gene (position 974 - 2392), and a synthetic 
polyadenylation sequence (position 2413 - 2591). 

The SSV gene was amplified from genomic DNA from the thermophilic bacterium 
Sulfolobus shibatae (DSM-5389, DSMZ Braunschweig- Deutsche Sammlung von 
Mikroorganismen und Zellkulturen GmbH, Mascheroder Weg lb, D-38124 
Braunschweig, Germany) in two PGR steps because of an internal attP sequence. 
First, two overlapping PGR fragments were created with the oligonucleotides 
SSVl-1 (SEQ ID N0:91) (or SSVl-2 for the SSV( NNLS) gene) and SSV2 (SEQ ID 
NO:93) and oligonucleotides SSV3 (SEQ ID NO:94) and SSV4 (SEQ ID NO:95). 
Using these overlapping fragments as template, a lOOObp fragment containing 
the complete SSV coding sequence was amplified with primers SSVl-1 (or SSVl- 
2 for the ;SSV(I^NLS) gene) and SSV4. The 5V 620 bp-fragments of these PGR 
products were isolated by digestion with Notl-Xhol and cloned Into vector 
pBluescript II KS giving rise to plasmids pRK47 and pRK48 (with NLS). The 3' 
380 bp fragment generated by Xhol-digestion was cloned into the Xhol 
restriction site of vector pBluescript II KS giving rise to the plasmid pBS-SSVs 
(SEQ ID NO:72). The 380bp SSV-fragment was then isolated by digestion of 
pBS-SSVs with Xhol and ligated into pRK47 and pRK48 opened by Xhol giving 
rise to plasmids pBS-SSV3 (SEQ ID NO: 70) and pBS-SSV4 (SEQ ID NO: 71) (with 
NLS) containing the complete SSV gene. Sequencing of the plasmids confirmed 
one point mutation in both plasmids. Therefore 312 bp/ 91 bp fragments 
generated by digestion with EcoRV-Smal/ EcoRV-XhoI of another clone of pRK47 . 
were exchanged in plasmids pBS-SSV3/ pBS-SSV4. Sequences were confirmed 
by sequencing. The SSV gene was isolated from pRK47 and pRK48 by digestion 
with NotI and Kpnl and ligated into the generic mammalian expression vecto 
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pRK50 (see below), opened with NotI and Sail, giving rise to the SSV expression 
vectors pCMV-SSV(wt) (SEQ ID NO:74) and pCMV-SSV(NNLS) (SEQ ID NO:75). 

Construction of the recombination test vector pRK64: first an attB site (Thorpe et 
5 al. Proc. Natl. Acad. Sci. USA, 95, 5505 - 5510 (1998)), generated by the 
annealing of the two synthetic oligonucleotides C31-4 (SEQ ID NO:l) and C31-5 
(SEQ ID N0:2), was ligated into the BstBI restriction site of the vector PSV-Paxl 

■A 

: (Buchholz et al.. Nucleic Acids Res., 24, 4256-4262 (1996)), 5:of its puromycin 

resistance gene and loxP site, giving rise to plasmid pRK52. The sequence and 
10 orientation of the cloned attB site was confirmed by DNA sequence analysis. 

Next, an attP site site (Thorpe et al. Proc. Natl. Acad. Sci. USA, 95, 5505 - 5510 
i (1998)), generated by the annealing of the two synthetic oligonucleotides C31-6 
i (SEQ ID NO:3) and C31-7-2 (SEQ ID NO:4), was ligated into the BamHI 

restriction site of plasmid pRK52, downstream of the puromycin resistance gene 
!15 and loxP site, giving rise to plasmid pRK64 (SEQ ID NO:5). The sequence and 

orientation of the attP site was confirmed by DNA sequence analysis. In pRK64 
; the newly cloned attB (position 348 - 431) and attP (position 1534 - 1617) sites 
I are in the same orientation flanking the puromycin resistance gene and the SV40 

i 

I early polyadenylation sequence. The attB and attP sites are followed by loxP sites 
20 in the same orientation (positions 435 - 469 and 1624 - 1658). The puromycin 
cassette is transcribed from the SV40 early enhancer/promoter region and 
followed by the coding region for E. coli 6-galactosidase and the SV40 late region 
polyadenylation sequence. 

25 Construction of C31-Int expression vectors: First the C31-Int gene of phage 
a>C31 was aniplified by PGR from phage DNA (DSM-49156, DSMZ-Deutsche 
Sammlung von Mikroorganismen und Zellkulturen GmbH, Mascheroder Weg lb, 
D-38124 Braunschweig, Germany) using the primers C31-1 (SEQ ID NO:6) and 
C31-3 (SEQ ID NO:7). The ends of the PGR product were digested with NotI and 

30 the product was ligated into plasmid pBluescript II KS, opened with NotI, giving 
rise to plasmid pRK40. The DNA sequence of the 1.85 kb insert was determined 
and found to be identical to the published C31-Int gene (Kuhstoss et al., J. Mol. 
Biol. 222, 897-908 (1991)), except for an error in the stop codon. This error was 
repaired by PCR amplification of a 300 bp fragment from plasmid pRK40 using 

35 the primers C31-8 (SEQ ID N0:8) and C31-9 (SEQ ID NO:9), which provide a 




corrected Stop codon. The ends of this PGR fragment were digested with 
Eco47in and Xhol, the fragment was ligated into plasmid pRK40 and opened 
with Eco47III and Xhol to remove the fragment containing the defective stop 
codon. The resulting plasmid pRK55 contains the correct C31-Int gene as 
5 confirmed by DNA sequence analysis. 

The C31-Int gene was isolated from pRK55 as 1.85 kb fragment by digestion with 
NotI and Xhol and ligated into the generic mammalian expression vector pRKSO 
(see below), opened with NotI and Xhol, giving rise to the CSl-Int expression 
10 vector pCMV-C31-Int(wt). pCMV-C31-Int(wt) (SEQ ID NO: 10) contains a 700 bp 
I cytomegalovirus im mediated early gene promoter (position 1 - 700), a 270 bp 
hybrid intron (position 701 - 970), the C31-Int gene (position 978 - 2819), and 
a 189 bp synthetic polyadenylation sequence (position 2831 - 3020). 
' For the construction of pCMV-C31-Int(NNLS) a 1.5 kb fragment was amplified by 
15 PGR from phage DNA using oligonucleotides C31-2 (SEQ ID NO:98) and C31-3 
j (SEQ ID N0:7). The ends of the PGR product were digested with NotI and the 
product was ligated into plasmid pBluescript II KS, opened with NotI, giving rise 
I to plasmid pRK41 (SEQ ID NO: 99): A 1100 bp fragment generated by digestion 
; of plasmid pRK41 with Ncol and NotI was then ligated into plasmid pRK55 (SEQ 
20 ID NO: 80), opened with Ncol and NotI, giving rise to the plasmid pRK63 (SEQ ID 
NO:81). The G31-Int gene with N-terminal NLS was isolated as a 1.8 kb fragment 
from pRK63 by digestion with NotI and Xhol and ligated into the mammalian 
expression vector pRK50, opened with NotI and Xhol, giving rise to the G31-Int 
expression vector pCMV-G31-Int(NNLS). pCMV-G31-Int(NNLS) (SEQ ID NO:73) 
25 contains a 700 bp Gytomegalovirus immediated early gene promoter (position 1 
- 700), a 270 bp hybrid intron (position 701 - 970), the C31-Int gene with N- 
terminal NLS (position 976 - 2838), and a 189 bp synthetic polyadenylation 
sequence (position 2851 - 3040). 

For the construction of pGMV-G31-Int(GNLS), the 3' -end of the G31-Int gene 
30 was amplified from pCM\/-C31-Int(wt) as a 300 bp PGR fragment using the 
primers G31-8 (SEQ ID N0:8) and C31-2-2 (SEQ ID NO: 11). Primer G31-2-2 
modifies the 3' -end of the wildtype G31-Int gene such that the stop codon is 
replaced by a sequence of 21 basepairs coding for the SV40 T-antigen nuclear 
localisation sequence of 7 amino acids (Prolin-Lysin-Lysin-Lysin-Arginin-Lysin- 
35 Valin) (Kalderon et. al, Gell, 39, 499 - 509 (1984)), followed by a new stop 
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codon. The ends of this 300 bp PGR fragment were digested with with Eco47III 
and Xhol, the fragnnent was llgated into plasmid pCMV-C31-Int(wt) and opened 
with EC047III and Xhol to replace the 3 '-end of the wildtype C31-Int gene 
resulting in the plasmid pClv|V-C31-Int(CNLS). The identity of the new gene 
segment was verified by DNA sequence analysis. pCIMV-C31-Int(CNLS) (SEQ ID 
NO: 12) contains a 700 bp cytomegalovirus immediated early gene promoter 
(position 12 - 711), a 270 bp hybrid intron (position. 712 - 981), ;the modified 
'C31-Int gene (position 989 - 2851), and a 189 bp synthetic polyaderiylation 
sequence (position 2854 - 3043). 



To generate the Cre expression plasmid pCMV-Cre (SEQTD NO: 13); the coding 
■ sequence of Cre recombinase (Sternberg et al., J. Mol. Biol., 187, 197 - 212 
I (1986)) with a N-terminal fusion of the 7 amino acid SV40 T-antigen NLS (see 
I above) was recovered from plasmid pgk-Cre and cloned into the NotI and Xhol 
15 sites of plasmid pRK50. PRK50 (SEQ ID NO: 14) is a generic expression vector for 

mammalian cells based on the cloning vector pNEB193 (New England Biolabs Inc, 
I Beverly, MA, USA). PRK50 was built by insertion into pNEB193 of a 700 bp 

cytomegalovirus immediated early gene (CIMV-IE) promoter (position 1-700) 
, from plasmid pIREShyg (GenBank#U89672; Clontech Laboratories Inc, Palo Alto, 
20 CA, USA), a synthetic 270 bp hybrid intron (position 701-970), consisting of a 

I 

adenovirus derived splice donor and an IgG derived splice acceptor sequence 
(Ghoi et al., Mol. Gell. Biol.,r ll, 3070^ 3074 (1991)), and a 189 bp synthetic 
polyadenylation sequence (position 1000-1188) build from the polyadenylation 
consensus sequence and 4 MAZ polymerase pause sites (Levitt et: al., 
25 Genes&Dev., 3, 1019 - 1025 (1989); The EMBO J. 13, 5656 - 5667 (1994)). The 
• positive control plasmid rpRK64(ACre) if(SEQ ID;;' NO: 15);:lwas.: generated ^from 
pRK64 by trarisformatloriQnto the Cre expressing E. coli strain 294-Cre (BuchHbIz 
et al.. Nucleic Acids Res., 24/3118 - 3119 (1996)).' ' 

30 One of the transformed subclones was confirmed for the Cre mediated deletion of 
the loxP-flanked cassette by restriction mapping and further expanded. Plasmid 
pUG19 is a cloning vector without eukaryotic control elements used to equalise 
DNA amounts for transfections (GenBank#X02514; New England Biolabs Inc, 
Beverly, MA, USA). All plasmids were propagated in DH5a E. coli cells (Life 

35 Technologies GmbH, Karlsruhe, Germany) grown in Luria-Bertani medium and 
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purified with the plasmid DNA purification reagents "Plasmid-I^laxi-Kit" (Quiagen 
GmbH, Hilden, Germany) or "Concert high purity plasmid purification system" 
(Life Technologies GmbH, Karlsruhe, Germany). Following purification, the 
plasmid DNA concentrations were determined by absorption at 260 nm and 280 
5 nm in UVette cuvettes (Eppendorf-Netheler-Hinz GmbH, Hamburg, Germany) 
using a BioPhotometer (Eppendorf-Netheler-Hinz GmbH, Hamburg, Germany) 
and the plasmids were diluted to the same concentration; finally these were 
confirmed by separation of 10 ng of each plasmid on an ethidiumbromlde-stained 
agarose gel. 



B. Cell culture and transfections: Chinese hamster ovary (CHO) cells (Puck et a!., 
J. Exp. Med., 108, 945 (1958)) were obtained from the Institute for Genetics 
(University of Cologne, Germany) as a population adapted to growth in DMEM 
medium. The cells were grown in DMEM/Glutamax medium (Life Technologies) 
15 supplemented with 10% fetal calf serum at 37°C, 10% CO2 In humid atmosphere 
and passaged upon trypsinisation. One day before transfection 10® cells were 



plated into a 48-well plate (Falcon). For the transient transfection of cells with 

i 

plasmids each well received into 250 ml of medium a total amount of 300 ng 
I supercoiled plasmid DNA complexed before with the FuGeneS transfection 
' 20 reagent (Roche Diagnostics GmbH, Mannheim, Germany) according to the 
manufacturers protocol. Each 300 ng DNA preparation (Fig. 2 sample 4 to 11) 
contained 50 ng of the lucjferase expression vector pUHC13-l (Gossen et al., 
Proc Natl Acad Sci USA., 89 5547-5551 (1992)), 50 ng of the substrate vector 
pRK64, 0.5 ng or 1 ng of one of the recombinase expression vectors pCMV- 
25 C31Int(wt), pCMV-C31Int(NNLS), pCMV-C31Int(CNLS) or pCMV-Cre and 199 ng 
or 199.5 ng of pUC19 plasmid, except for the controls which received 50 ng of 
pUHC13-l together with 50 ng of pRK64 (sample 3) or pRK64(Acre) (sample 2) 
and 200 ng pUC19, or 50 ng pUHC13-l with 250 ng pUC19 (sample 1). 
Transfections of Ssv and XisA recombinases (Fig. 3) also contained 50 ng of the 
30 luciferase expression vector pUHC13-l, 50 ng of substrate vectors pPGKattA and 
pPGKnif and 10 ng or 20 ng of recombinase expression vector pCMV-SSV or 
pCMV-SSV(NNLS). or 25 ng or 100 ng of expression vectors pCMV-XisA/ pCMV- 
XisA(NNLS). Plasmid pUC19 was added to a total amount of 300 ng plasmid DNA. 
As the C31-Int expression vectors are 15% larger in size than pCMV-Cre and the 
35 same amounts of DNA of the three plasmids were used for transfection, the 
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samples with C31-Int vectors received 15% less plasmid molecules as compared 
to the samples with Cre expression vector. The 6-galactosidase values from C31- 
Int transfected samples by this value were not corrected and thus is a slight 
underestimation of the calculated C31-Int activities. For each sample to be tested 
5 four individual wells were transfected. One day after the addition of the DNA 
preparations each well received additional 250 ml of growth medium. The cells of 
each well were lysed 48 hours after transfection with 100 ml lysate reagent 
supplemented with protease inhibitors (Roche Diagnostics). The lysates were 
centrifuged and 20 ml were used to determine the B-galactosidase activities 
10 using the 6-galactosidase reporter gene assay (Roche Diagnostics) according to 
I the manufacturers protocol in a Lumat LB 9507 luminometer (Berthold). To 

I measure luciferase activity, 20ml lysate was diluted into 250ml assay buffer 

(50mM glycylglycin, 5mM MgCb, 5mM ATP) and the "Relative Light Units" (RLU) 
j were counted in a Lumat LB 9507 luminometer after addition of 100 ml of a 1 
•15 mM luciferin (Roche Diagnostics) solution. The mean value and standard 
deviation of the samples was calculated from the B-galactosidase and luciferase 

RLU values obtained from the four transfected wells of each sample. 

^ j ■ 

! C. Results: To set up an assay system for the measurement of C31-Int and Cre 
! , 20 recombinase efficiency in mammalian cells the recombination substrate vector 
pRK64 shown in Figure IE was first constructed. pRK64 contains a SV40 
promoter region for expression in mammalian cells followed by a 1.1 kb cassette 
which consists of the coding region of the puromycin resistance gene and a 
polyadenylation signal sequence. This cassette is flanked at the 5 '-end by the 84 

25 bp attB and at the 3 '-end by the 84 bp attP recognition site of C31-Int (Fig.l 
and 6). These attB and attP sites are located on the same DNA molecule and 
oriented in a way to each other which allows the deletion of the flanked DNA 
segment. The same orientation of attB and attP sites is used naturally by the 
^)C31 phage and the bacterial genome, leading to the integration of the phage 

30 genome when both sites are located on different DNA molecules (Thorpe et al., 
Proc. Natl. Acad. Sci. USA, 95, 5505 - 5510 (1998)). To measure C31-Int and 
Cre recombinase activities with the same substrate vector, pRK64 contains in 
addition two Cre recognition (loxP) sites in direct orientation next to the att sites. 
Since the att/lox-flanked cassette in plasmid pRK64 is inserted between the SV40 

35 promoter and the coding region of the B-galactosidase gene, its presence inhibits 
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28 

6-galactosidase expression as the SV40 promoter derived transcripts are 
terminated at the polyadenylation signal of the puromycin gene. Plasmid pRK64 
is turned into a 6-galactosidase expression vector upon C31-Int or Cre mediated 
deletion of the att/lox-flanl<ed puromycin cassette since the remaining single att 
and loxP site do not substantially interfere with gene expression. 

For the expression of recombinases a mammalian expression vector was 
designed which contains the CMV immediate early promoter followed by a hybrid 
intron, the coding region of the recombinase to be tested, and an artificial 
polyadenylation signal sequence. The backbone sequence of the four 
recombinase expression vectors shown in Figure lA-D is identical to each other 
except for the recombinase coding region. Plasmid pCMV-C31Int(wt) (Fig. lA) 
contains the nonmodified (wildtype) 1.85 kb coding region of C31-Int as found in 
the genome of phage OCSl (Kuhstoss, et al., J. I^iol. Biol. 222, 897-908 (1991)). 
Plasmid pCMV-C31Int(NNLS) (Fig. IB) contains a modified C31-Int gene coding 
for the full length C31-Int protein with a N-terminal extension of 7 amino acids 
derived from the SV40 virus large T antigen which serves as a nuclear 
localisation signal (NLS). Plasmid pCMV-C31Int(CNLS) (Fig. IG) contains a C- 
terminal extension of 7 amino acids derived from the SV40 virus large T antigen 
which serves as a nuclear localisation signal (NLS). Plasmid pCMV-Cre (Fig. ID) 
contains the 1.1 kb Cre coding region with an N-terminal fusion of the 7 amino 
acid NLS of the SV40 Trantigen. For Cre recombinase it has been shown that the 
N-terminal addition of the SV40 T-antigen NLS does not increase its 
recombination efficiency , in mammalian cells (Le et al.. Nucleic Acids Res., 27, 
4703 - 4709 (1999)). 
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As a test system to compare the efficiency of the 4 recombinases the same 
amount of plasmid DNA of each of the recombinase expression vectors together 
with a fixed amount of the reporter plasmid pRK64 was transiently introduced 
into Chinese Hamster Ovary (CHO) cells. Thus, in this assay design the efficiency 
of the various recombinases on an extrachromosomal substrate introduced into 
the CHO cells was compared as a circular plasmid. Two days afl:er transfection 
the cells from the various samples were lysed and the activity of B-galactosidase 
in the lysates was' determined by a specific chemiluminescense assay and 
expressed in "Relative Light Units" (RLU (B-Gal)) (Fig. 2). In addition all samples 



contained a fixed amount of a luciferase expression vector to control for- the 
experimental variation of cell transfection and lysis. For this purpose the lysates 
of each sample were also tested for luciferase activity with a specific 
chemiluminescense assay and the values expressed as "Relative Light Units" 
(RLU (Luciferase)) (Fig. 2). All transfection samples contained in addition varying 
amounts of the unrelated cloning plasmid pUC19 so that all samples were 
equalised to the same amount of plasmid DNA." As a positive control for 6- 
galactosldase a derivative of the recombination reporter pRK64 was used in 
which the loxP flanked 1,1 kb cassette has been removed through Cre mediated 
recombination in E. coll giving rise to plasmid pRK64(AGre). As negative controls 
served samples which received the unrecombined reporter plasmid pRK64 but no- 
recombinase expression vector as well as samples set up with the pUC19 plasmid 
alone. 

To determine the relative efficiency of the tested recombinases the RLU values of 
(3-gaiactosidase were divided individually for each sample by the RLU values 
obtained for luciferase and multiplied with 10^ From the values of the four data 
points of each sample the mean value and standard deviation was calculated as- 
an indicator of recombinase activity (Gal/Luc) (Fig. 2). The relative activity of the 
tested recombinases was then compared to the positive control defined as an 
activity of 1. 

As shown in Fig. 2, the expression of Cre recombinase (samples 10 and 11) 
resulted in a 150 to 170-fold increase of 6-galactosidase activity as compared to • 
the negative control (sample 3), demonstrating the wide dynamic range of our 
test system. Each recombinase vector was tested using two different amounts of . 
DNA for transfection (0.5 and Ing/sample), which in the case of Cre resulted^^^^^^^^^^^ 
63% and 72% recombinase activity (samples 10 and 11 as compared to the . 
positive control). These two values establish that the DNA amounts used are 
close to the test systems saturation for recombinase expression as the doubling 
of DNA amounts resulted only in a minor increase of recombinase activity. 

In comparison to Cre, the expression of wildtype C31-Int resulted in a 
considerably lower recombinase activity of 23% and 30% (Fig. 2, samples 4 and 
5) as compared to the positive control. These values represent 37% and 42% 
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recombinase activity for wildtype C31-Int as compared to Cre recombinase 
(compare samples 4 and 5 with 10 and 11). Upon tiie expression of C31-Int 
fused witii tlie N-terminal NLS (C31-Int(NNLS)) values of 32% and 36% 
recombinase activity (samples 6 and 7) were obtained as compared to the 
5 positive control. The C31-Int(NNLS) values represent 51% and 50% recombinase 
activity as compared to Cre (compare samples 6 and 7 to 10 and 11). Thus, the 
activity of C31-Int in mammalian cells is just moderately enhanced by the 
addition of a NLS signal. 

Surprisingly, upon the expression of C31-Int fused with the C-terminal NLS (C31- 
, 6 Int(CNLS)) values of 50% and 65% recombinase activity (samples 8 and 9) were 
i obtained as compared to the positive control. The C31-Int(CNLS) values 
j represent 79% and 90% recombinase activity as compared to Cre recombinase 

(compare samples 84 and 9 to 10 and 11). Unexpectedly,C31-Int(CNLS) exhibits 
a dramatic, more than twofold increase of recombinase activity in comparison to 
j 15 C31-Int(wt) (compare samples 8 and9 to 4 and 5). 

i 

I In order to test whether the addition of a NLS sequence may be a general, 

I 

■ simple method to enhance recombinase activity in mammalian cells we extended 
' our studies by two additional recombinases: XisA recombinase (XisA) derived 
20 from the cyanobacterium Anabaena, and SSV-Integrase (SSV-Int) derived from 
the SSVl virus of the thermophilic bacterium Sulfolobus shibatae. To this end we 
constructed mammalian expression vectors for the wildtype forms of XisA and 
SSV recombinases and compared their activity to versions which were modified 
by the N-terminal addition of the 7 amino acid NLS of the SV40 T-antigen. These 
25 recombinases were compared by the use of the reporter vector shown in Fig. IE, 
except that the att elements of C31-Int were replaced by the nif recognition , 
sequences for XisA or the att sequences for SSV-Int. As described above for C31- 
Int, recombinase activities were tested by transient transfection into CHO cells 
using the reporter vector derived p-galactosidase activity as readout and 
30 cotransfected luciferase as internal control. 

As shown in Fig. 3 for both, XisA and SSV recombinases the addition of a NLS 
sequence did not improve their activity In a mammalian cell line as compared to 
the wildtype forms. At both DNA concentrations tested wildtype XisA exhibits a 
significant recombination activity as compared to the reporter vector alone 
35 (compare samples 2 and 3 to sample 1), but this activity is not altered by the 



addition of an NLS (compare samples 2 and 3 to samples 4 and 5). SSV-Int 
exhibits only a low recombination activity (compare samples 7 and 8 with sample 
6) which is also not enhanced by the addition of a NLS (compare samples 9 and 
10 with samples 7 and 8). From these results we conclude that the addition of a 
NLS to an inefficient recombinase Is not a general, simple method to improve its 
performance in mammalian cells. 

Taken together, in the transient transfection test system shown In Figure 2 a 
more than twofold activity increase of the OC31 Integrase could be achieved by 
the C-terminal, but not the N-termlnal addition of the SV40 T antigen IMLS signal. 
As this signal sequence has been characterised to act as a nuclear localisation 
signal (Kalderon et. al, Cell, 39, 499 - 509 (1984)) we conclude that the 
efficiency Increase of C31-Int(CNLS) Is the result of the improved nuclear 
accumulation of this recombinase. The relative Inefficiency of C31-Int (NNLS) 
may be explained by the inaccessibility of the NLS peptide to the nuclear import 
machinery at the N-terminal position of the C31-Int protein. 
In particular, it could be shown that C31-Int(CNLS) recombines 
extrachromosomal DNA In mammalian cells almost as efficient as the widely used 
Cre recombinase and thus provides an additional or alternative recombination 
system of highest activity. The efficiency Increase of C31-Int(CNLS) as compared 
to its wildtype form Is regarded as an invention of substantial use for 
biotechnology. ' 

Example 2 

As demonstrated In example 1 C31-Int recombinase with the C-terminal fusion of 
the SV40 T-antigen NLS (C31-Int(CNLS)) shows In mammalian cells a 
recombination activity comparable to Cre recombinase on an extrachromosomal 
plasmid vector. It was further tried to test whether C31-Int(CNLS) exhibits a 
similar activity on a recombination substrate which is chromosomally integrated 
into the genome of mammalian cells. This question is critical for the use of a 
recombination system for genome engineering as it is possible that a 
recombinase may act efficiently on extrachromosomal substrates but is impaired 
if the recognition sites are part of the mammalian chromatin. To characterise the 
recombination activity of C31-Int(CNLS) and C31^Int(NNLS) on a chromosomal 
substrate the pRK64 report:er plasmid (Fig. IE) was stably integrated, containing 



a pair of loxP and att sites, into the genome of a mammalian ceil line. One of the 
stable transfected clones was chosen for further analysis and was transiently 
transfected with recombinase expression vectors coding for C31-Int(CNLS), C31- 
Int(NNLS), C31-Int(wt) or Cre recombinase. The activity of 6-galactosidase 
derived from the Cre expression vector recombined in these cells was taken as a 
measure of recombination efficiency. 

A. Plasmid constructions: all plasmids used and their purification are described in 
example 1. 

B. Cell culture and transfections: To generate a stably transfected C31-Int 
reporter cell line 2.5 x 10^ NIH-3T3 cells (Andersson et al., Cell, 16, 63-75 
(1979); DSMZ#ACC59; DSMZ-Deutsche Sammlung von Mikroorganismen und 
Zellkulturen GmbH, Mascheroder Weg lb, D-38124 Braunschweig, Germany) 
were electroporated with 5 [ig pRK64 plasmid DNA linearised with Seal and 
plated into 10cm petri dishes. The ceils were grown in DMEM/Glutamax medium 
(Life Technologies) supplemented with 10% fetal calf serum at 37°C, 10% CO2 in 
humid atmosphere, and passaged upon trypsinisation. Two days after tranfection 
the medium was supplemented with Img/ml of puromycin (Calbiochem) for the 
selection of stable integrants. Upon the growth of resistant colonies these were 
isolated under a stereomicroscope and individually expanded in the absence of 
puromycin. To demonstrate stable integration of the transfected vector, genomic 
DNA of puromycin resistant clones was prepared according to standard methods 
and 5-10 \jg were digested with EcoRV. Digested DNA was separated in a 0.8% 
agarose gel and transferred to nylon membranes (GeneScreen Plus, NEN 
DuPont) under alkaline conditions for 16 hours. The filter was dried and 
hybridised for 16 hours at 65°C with a probe representing the 5' part of the E. 
coli B-galactosidase gene (1.25 kb NotI - EcoRV fragment of plasmid CIMV-B-pA 
(R. Kuhn, unpublished). The probe was radiolabelled with P32-marked a-dCTP 
(Amersham) using the Megaprime Kit (Amersham). Hybridisation was performed 
in a buffer consisting of 10% dextranesulfate, 1% SDS, 50 mM Tris and 100 mM 
NaCI, pH7.5). After hybridisation the filter was washed with 2x SSC/1%SDS and 
exposed to BioMax MSI X-ray films (Kodak) at - 80°C. 

Transfections of the selected clone 3T3(pRK64)-3 with plasmid DNAs and the 
measurement of B-galactosldase activities in lysates were essentially performed 
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as described in example 1 for CHO cells, except that 32ng or 64ng of the 
recombinase expression plasmids and 218 or 186 ng of pUC19 plasmid were 
used and the pRK64 plasmid was omitted from all samples. 

C. Histochemical detection of B-aalactosidase activity in transfected 3T3r pRKF;4)- 
3 cells 

To directly demonstrate B-galactosidase expression in recombinase transfected 
cells, 10** 3T3(pRK64)-3 cells were plated one day before transfection into each 
well of a 48-well tissue culture plate (Falcon). For the transient transfection of 
cells with plasmids each well received into 250 pi of medium a total amount of 
150 ng supercoiled plasmid DNA complexed before with the FuGene6 transfection 
reagent (Roche Diagnostics GmbH, Mannheim, Germany) according to the 
manufacturers protocol. Each 150 ng DNA preparation contained 50 ng of the 
recombinase expression vector pCMV-Cre or pCMV-C31Int(CNLS) and lOOng of 
the pUC19 plasmid. After 2 days the culture medium was removed from the 
wells, the wells were washed once with phosphate buffered saline (PBS), and the 
cells were fixed for 5 minutes at room temperature in a solution of 2% 
formaldehyde and 1% glutaraldehyde in PBS. Next, the cells were washed twice 
with PBS and finally incubated in X-Gal staining solution for 24 hours at 37°C 
(staining solution: 5 mivi K3(Fe(CN)6), 5 mM K4(Fe(CN)6), 2 mM MgCb, Img/ml 
X-Gal (BioMol) in PBS) until photographs were taken. 

D. Results . i; 

To generate a mammalian cell clone with a stable genomic integration of the 
C31-Int and Cre recombinase reporter plasmid pRK64, the murine fibroblast cell 
line NIH-3T3 was electroporated with linearised pRK64 DNA (Fig.lD; see also 
example 1) and subjected to selection in puromycin containing growth medium. 
Plasmid pRK64 contains in between the pair of loxP and att sites the coding 
region of the puromycin resistance gene expressed from the SV40-IE promoter. 
Thirty-six puromycin resistant clones were isolated and the genomic DNA of 19 
clones was analysed for the presence and copy number of the pRK64 DNA, Three 
clones, which apparently contain 2 - 4 copies of pRK64, were further 
characterised on the single cell level for the expression of 6-galactosidase upon 
transient transfection with the Cre expression vector pCMV-Cre (Fig. IC). The cell 
clone with the largest proportion of B-galactosidase positive cells, 3T3(pRK64)-3, 
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was selected as most useful for the planned studies on C31-Int and Cre 
recombinase efficiency. 

To compare the efficiency of wildtype C31-Int (C31-Int(wt)), C31-Int(CNLS), 
5 C31-Int(NNLS), and Cre recombinases 32ng or 64 ng of the recombinase 
expression vectors pCMV-C31Int(wt), pCMV-C31Int(CNLS), pCMV-C31Int(NNLS), 
or pCMV-Cre (Fig. Iv^ A-D) together with the unrelated cloning plasmid pUC19 
were transiently introduced Into 3T3(pRK64)-3 cells, such that all samples 
contained the same amount of plasmid DNA. As a negative control a sample 
prepared with the pUC19 plasmid alone was used. Two days after transfection 
the cells from the various samples were lysed and the activity of B-galactosidase 
in the lysates was determined by a specific chemiluminescense assay and 
expressed in "Relative Light Units" (RLU)(B-Gal) (Fig, 4). From the values of the 
four data points of each sample the mean value and standard deviation was 
is calculated as an indicator of recombinase activity (Fig. 4). The relative activity of 
the tested recombinases was then compared to the highest value obtained with 

j the Cre expression vector, defined as an activity of 1. 

I 

As shown in Figure 4 the expression of Cre recombinase (samples 8 and 9) 
20 resulted in a 36 to 49-fold increase of B-galactosidase activity as compared to the 
negative control (sample 1), demonstrating the dynamic range of the test system 
used. Each recombinase vector was tested using two different amounts of DNA 
for transfection (32 ng and 64 ng/sample), which in the case of Cre resulted in 
73°/o and 100% recombinase activity (samples 8 and 9). These two values 
25 establish that the DNA amounts used are not far from the linear scale of the test 
systems ability for recombinase expression as the twofold increase of the amount 
of DNA also resulted in a significant increase of recombinase activity. 

The expression of wildtype C31-Int (Fig. 4, samples 2 and 3) resulted in a low 
30 recombinase activity of 4% and 10% as compared to thevalues obtained by Cre 
transfection (compare samples 2 and 3 with 8 and 9). This activity was only 
moderately enhanced by the expression of C31-Int(NNLS) to values of 19% and 
22% of Cre activity (compare samples 4 and 5 with samples 8 and 9). Upon the 
. expression of C31-Int(CNLS) values of 48% and 78% recombinase activity were 
35 obtained as compared to Cre recombinase (compare samples 6 and 7 to 8 and 
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9). Hence, C31-Int(CNLS) exhibits an 12-fold higher activity than C31-Int(wt) at 
32 ng plasmid DNA (Fig.4, compare samples 6 and 2) and an 8-fold higher 
activity than C31-Int(wt) at 64 ng plasmid DNA (Fig.4, compare samples 7 and 
3). 



In addition, it was aimed to directly demonstrate in situ the expression of B- 
galactosidase in 3T3(pRK64)-3 cells after transfectlon with Cre or C31-Int(CNLS) 
recombinase plasmid. Two days after transfectlon the cells were fixed in situ and 
incubated with the histochemical X-Gal assay which detects 6-gaIactosidase 
10 positive cells by a blue precipitate. As shown in Figure 5 stained cells were found 
at a comparable frequency in the samples transfected with the Cre or C31- 
Int(CNLS) expression vectors but not in the nontransfected control. This result 
confirms that the B-galactosidase activities measured by chemiluminescense 
upon recombinase transfectlon (Fig. 4) results from a population of individual, 
5 recombined reporter cells. 

In conclusion, upon the transient transfectlon of recombinase expression vectors 
into a ceil line with a genomic integration of the recombination substrate vector, 

' a 8 - 12-fold activity increase of the OC31 Integrase by the C-terminal fusion 

20 with the SV40 T-antigen NLS signal was found. As this signal sequence has been 
characterised to act as a nuclear localisation signal (Kalderon et. al. Cell, 39, 499 
- 509 (1984)), it v^as concluded that the dramatic efficiency increase of C31- 
Int(CNLS) is the result of the Improved nuclear accumulation of this 
recombinase. The approximately tenfold activity increase of C31-Int(CNLS) upon 

25 expression in a cell line with a genomic integration of the substrate vector is 
considerably higher than the activity increase found upon the transient 
expression of both vectors (see example 1). Thus, a substrate vector integrated 
into the chromatin of a mammalian cell may pose more stringent requirements 
on recombinase activity to be recombined as compared to an extrachromosomal 

30 substrate. 

The dramatic activity increase of C31-Int(CNLS), as compared to its wildtype 
form, on a stable integrated substrate in mammalian cells is an invention of 
significant practical use as this recombinase is as efficient as the widely used 
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Cre/loxP system; thus, C31^Int(CNLS) provides an additional or alternative 
recombination system of highest activity. 



Example 3 

To demonstrate that the increase in B-galactosidase activity obtained by the 
cotransfection of a C31-Int expression vector and the reporter vector pRK64 into 
mammalian cells is in fact the result of recombinase mediated deletion, one of 
the recombination products was detected by a specific polymerase chain reaction 
(PGR). The amplified PGR product was cloned and its sequence determined. The 
obtained sequence confirms that C31-Int mediated deletion of the test vector 
occurs in a mammalian cell line and that the recombination occurs at the l<nown 
breakpoint within the attB and attP sites. 

A. Plasmid constructions: The construction of plasmids pRK64, pCMV-Gre and 
pGI^V-G31-Int(wt) is described in Example 1. To simulate the recombination of 
pRK64 by C31-Int, the sequence between the CAA motives of the att sites 
(boxed in Fig. 5) was deleted from the computerfile of pRK64, giving rise to the 
sequence of pRK64(AInt) (SEQ ID NO: 16). 

B. Transfection of Cells and PGR amplification: |viEF5-5 mouse fibroblasts 
(Schwenk et al., 1998) (20000 cells per well of a 12 well plate (Falcon)) were 
transfected with 0.5 pg pRK64 alone or together with 250 ng pGMV-Int(wt) or 
pGMV-Gre using the FuGeneS transfection reagent following the manufacturers 
protocol (Roche Diagnostics). After 2 days DNA was extracted from these cells 
according to standard methods and used for PGR amplification with Primers P64- 
1 (SEQ ID N0:17; complementary to position 111-135 of pRK64(AInt)) and P64- 
4 (SEQ ID N0:18; complementary to position 740-714 of pRK64(AInt)) using the 
Expand High Fidelity PGR kit (Roche Diagnostics). PGR products were separated 
on a 0.8% agarose gel, extracted with the QuiaEx kit (Quiagen) and cloned into 
the pGR2.1 vector using the TA cloning kit (Invitrogen) resulting in plasmid 
pRKSOd. The sequence of its insert, seq80d (SEQ ID NO: 19), was determined 
using the reverse sequencing primer and standard sequencing methods (MWG 
Biotech). 

For the measurement of B-galactosidase activity the cells were lysed 2 days after 
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transfection and the B-galactosidase activities were determined with the 6- 
galactosidase reporter gene assay (Roche Diagnostics) as described in example 
1. 



5 C. Results: As a test vector for C31-Int mediated DNA recombination plasmid 
pRK64 was used, which contains the 1.1 kb coding region of the puromycin 
resistance gene flanlced 5' by the 84 bp attB' and 3' by the 84 bp attP 
recognition site of C31-Int (Fig. 5). These attB and attP sites are located on the 
same DNA molecule and oriented in a way to each other which allows the 
deletion of the att-flanked DNA segment. The same orientation of attB and attP 
i sites is used naturally by the OC31 phage and the bacterial genome for the 
I integration of the phage genome when both sites are located on different DNA 
molecules (Thorpe et al., Proc. Natl. Acad. Sci. USA, 95, 5505 - 5510 (1998)). As 
a positive control, vector pRK64 contains in addition two Cre recombinase 
5 recognition (loxP) sites in direct orientation next to the att sites. Since the att- 
flanked DNA segment in plasmid pRK64 is Inserted between a promoter active in 
mammalian cells and the 6-galactosidase gene, its deletion can be measured by 
the increase of 6-galactosidase activity. The expected product of C31-Int 
mediated deletion of plasmid pRK64 is shown in Fig. 6, designated as 
20 pRK64(AInt). If the recombination between attB and attP occurs as described In 
bacteria (Thorpe et al., Proc. Natl. Acad. Sci. USA, 95, 5505 - 5510 (1998)), a 
single attR, site Is generated and left on the parental plasmid (Fig. 6) while the 
" flanked DNA is excised and contains an attL site. Beside the measurement of 6-' 
galactosidase activity, C31-Int mediated recombination of pRK64 can be directly 
25 detected on the DNA level by a specific polymerase chain reaction (PGR) using 
the primers P64-1 and P64-4 (Fig. 6). These primers, located 5 ' of the attB site 
(P64-1) and 3 'of the attP site, are designed to amplify a PGR product of 630 bp 
lenght upon the C31-Int mediated recombination of pRK64. For the expression of 
C31-Int in mammalian cells plasmid pGMV-G31(wt) was used, which contains the 
30 CMV-IE-Promoter upstream of the G31-Int coding region followed by a synthetic 
polyadenylation signal (see Example 1 and Fig.l). 

The recombination substrate vector pRK64 was transiently transfected Into the 
murine fibroblast cell line MEF5-5 either alone, or together with the G31-Int 
expression vector pC|ViV-G31(wt), or together with an expression vector for Cre 
recombinase, pCMV-Gre. Two days after transfection half the cells of each sample 
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was lysed and used to measure 6-galactosidase activity by chemiluminescense, 
and the other half was used for the preparation of DNA from the transfected cells 
for PCR analysis. The B-galactosidase levels of the 3 samples were found as 
following (expressed as "Relative Light Units" (RLU) with standard deviation (SD) 
5 of the B-galactosidase assay): 

Sample RLU (SD) 

1) pRK64 692 +5 

2) pRK64 + pCMV-Cre 8527 ±269 
10 3) pRK64 + pCMV-C31(wt) 1288 + 93 

; As the coexpression of the test vector pRK64 together with the C31-Int 
expression vector In sample 3 leads to a significant increase of B-galactosidase 
activity as compared to pRK64 alone, this result suggests that pRK64 is 
recombined by C31-Int as anticipated in Fig. 6, 

Next, cellular DNA was prepared from the three samples and tested for the 
; occurrence of the expected Cre or C31-Int generated deletion product by PCR 
\ using primers P64-1 and P64-4 for amplification. As shown in Fig. 7 an 
zO amplification product of the expected size was found only in the samples 
cotransfected with the Cre or C31-Int recombinase expression vectors (Fig. 7A, 
Iane3 and lane 4). The PCR products amplified from pRK64frecombined by C31- 
Int or Cre are of the same size but should be recombined via the attB/P or loxP 
sites, respectively. 

25 To prove that the PCR product found after cotransfection of plasmids pRK64 and 
pCMV-C31(wt) represents in fact the deletion product of C31-Int mediated 
recombination, this DNA fragment was cloned into a plasmid vector and its DNA 
sequence determined. One clone, pRKSOd, was analysed, and its sequence 
showed exactly the sequence of an attR site as expected from C31-Int mediated 

30 deletion of ,pRK64 (Fig. 7B, compare to Fig. 6). 

In conclusion, this experiment demonstrates that C31-Int mediated deletion of a 
vector containing a pair of attB/attP sites occurs in a mammalian cell line, and 
that the recombination occurs within the same 3 bp breakpoint region of attB and 
35 attP as found in bacteria (Thorpe et al., Proc. Natl. Acad. Sci. USA, 95, 5505 - 




5510 (1998)). Thus, it was concluded that an increase of 6-galactosidase activity 
observed by cotransfection of the pRK64 reporter vector and a C31-Int 
expression vector in mammalian cells truly reflects C31-Int recombinase activity. 

5 

Example 4 

As has been demonstrated in examples 1-3, the C31-Int recombinase with the C- 
terminal fusion of the SV40 T-antigen NLS (C31-Int(CNLS)) shows a 
iQ recombination activity comparable to Cre recombinase on an extrachromosomal 

i as well as a chromosomally integrated target in mammalian cells in vitro. To test 
whether C31-Int(CNLS) exhibits activity in mice, transgenic mice carrying a C31- 
Int(CNLS) expression vector were generated. These transgenic mice were 
crossed with reporter mice carrying the recombinase substrate. Recombination- 

5 mediated expression of p-galactosidase, which can be measured by staining with 
the substrate X-Gal, was analyzed in testes of double transgenic progeny 
carrying both the recombinase and the reporter. 

A. Plasmid constructions: For the construction of the C31-Int(CNLS) transgene 
20 expression vector, the C31Int gene with C-terminal NLS was isolated as a 2 kb- 

fragment generated by restriction of pCMV-C31Int(CNLS) (SEQ ID IMO: 12) with 
Bglll. The fragment was ligated into the Bglll restriction site of the vector 
pCAGGS-Cre-pA (SEQ ID NO: 104) giving rise to the plasmid pCAGGS-C31CNLS- 
pA (SEQ ID NO: 105). In pCAGGS-C31CNLS-pA the C31-Int(CNLS) (position 
25 1891-3753) is transcribed from the CAGGS promoter (position 1-1616) and 
followed by the SV40 late region polyadenylation sequence (position 3763-3941). 

B. Production of transgenic mice: For the embryo injections a 3.95 kb-fragment 
was generated by restriction of the plasmid pCAGGS-C31CNLS-pA with PstI and 

30 Ascl. This fragment was purified as follows: DNA bands were separated on an 
agarose-gel without ethidiumbromide. One part of the gel was stained with 
ethidiumbromide to locate the band to excise. The DNA was electroeluted from 
the excised band with S&S Biotrap Elution Chamber in Ix TAE (40 mM Tris- 
acetate, 1 mM EDTA) overnight. The DNA was precipitated from the eluate with 

35 1/10 volume 3M sodium acetate and 2.5 volumes ethanol at -20 °C for several 
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hours. The DNA was pelleted by centrifugation at 13000 rpm for 30 min and 
washed twice with 70 % ethanol. The dried DNA pellet was resuspended in TE 
(10 mM Tris, 1 mM EDTA, pH 8). Subsequently the precipitation procedure was 
repeated once and the DNA resuspended in injection buffer (10 mM Tris pH 7.2, 

5 0.1 mM EDTA ). The sample was dialysed with Slide-A-Lyse Mini Dialysis Unit 
(Pierce) in Injection buffer with several changes of buffer at 4°C overnight. 
Different amounts of the sample were checked on a gel to determine 
concentration. To generate transgenic mice, 5-10 fg of the -purified fragment 
were injected Into one pronucleus of (B6CBA)F2 mouse one-cell embryos. The 

D injected embryos were subsequently transferred into the oviduct of 0.5 day 
pseudopregnant NMRI females. 



C. Analysis of transgenic mice: Mice were analyzed for the presence of the 
pCAGGS-C31CNLS-pA transgene by PGR using tail DNA and the primers C31- 

5 screen 1 (SEQ ID NO: 100) and C31-screen 2 (SEQ ID NO: 101) amplifying a 
fragment of 500 bp. The PGR reaction contained 5 pi PGR buffer (Invitrogen), 2 
pi 50 mM MgCb , 1.5 pi 10 mM dNTP-mix, 2 pi (10 pmol) of each primer, 0.5 pi 
Taq-polymerase (5 U/ pi) and water to a volume of 50 pi. The program used for 
the PGR reactions was: 94 °G for 30 s, 55 °G for 30 s and 72 °C for 1 min in 30 
20 cycles. 

D. Analysis of C31-Int(GNLS) activity: Founder mice transgenic for the pGAGGS- 
G31GNLS-pA transgene were crossed to heterozygous G31 reporter- mice carrying 
the C31 reporter construct in the ROSA26 locus (SEQ ID NO: 106) (Fig. 8). 



25 Offspring of the crosses were genotyped for the presence of the pCAGGS- 
C31CNLS-pA transgene by the PGR assay described in section G as well as for the 
ROSA26-G31 reporter allele by a LacZ-specific PGR assay. The PGR was 
performed using tail DNA and the primers p-Gal 3 (SEQ ID N0:102) and p-Gal 4 
(SEQ ID NO: 103) amplifying a fragment of 315 bp. The PGR reaction contained 5 

30 pi PGR buffer (Invitrogen), 2.5 pi 50 mM MgCh , 2 p! 10 mM dNTP-mix, 1 pi (10 
pmol) of each primer, 0.4 pi Taq-polymerase (5 U/ pi) and water to a volume of 
50 pi. The program used for the PGR reactions was: 94 °C for 1 min, 60 °C for 1 
min and 72 °C for 1 min in 30 cycles. 
. Testes from mice carrying the pGAGGS-G31GNLS-pA transgene as well as the 

35 reporter locus and from a control mouse carrying the reporter allele only were 



dissected. The tissues were imbedded in OCT Tissue freezing medium 
(Leica/Jung) and frozen in liquid nitrogen. Cryosections were generated from the 
embedded tissues using a Leica CM3050 cryomicrotome, dried on polylysine- 
coated slides for 1-4 hours and then stained as follows: Sections were fixed in 
0.2 % glutaraldehyde, 5 mM EGTA, 2 mM MgOz in 0.1 1^ PB (K2HPO4/ KH2PO4, pH 
7.3) for 5 min at room temperature and washed in wash buffer (2 mM MgCb, 
0.02 % Nonidet-40 in PB in 0.1 M PB) 3 times for 15 min. Then sections were 
stained in X-Gal-solution (0.6 mg/ ml X-Gal in DMSO, 5 mM potassium 
hexacyanoferrat III, 5 mM potassium hexacyanoferrat II in LacZ wash buffer) 
overnigth at 37 °C. After staining sections were washed in Ix PBS twice for 5 
min. Dehydration was performed by washing the sections first with 70 %, 96 % 
and 100 % ethanol for 2 min each, then with a 1:1 mix of ethanol and xylol for 5 
min and in the end only with xylol for 5 min. Before taking pictures sections were 
mounted in Entellan. 

E. Results: To identify transgenic founder mice carrying the pCAGGS-C31CNLS- 
pA transgene, 29 mice born from the injection experiment were analyzed for the 
presence of the transgene. 5 founder mice (3 females and 2 males) were 
identified. To analyze the activity of the C31-Int(CNLS) recombinase in 
transgenic mice, 2 of the female founder mice were crossed to heterozygous C31 
reporter mice carrying a C31 reporter construct in the ROSA26 locus (Fig. 8). 
From each of these crosses, one offspring carrying the pCAGGS-C31CNLS-pA 
transgene as well as the C31 reporter allele was sacrificed. In oder to determine 
whether pCAGGS-C31CNLS-pA transgenic mice are able to delete an attB/P 
flanked DNA sequence in the mouse germline, tissue sections from the testes of 
the sacrificed animals were prepared and stained for 6-galactosidase activity with 
X-Gal. Fig. 9 shows the result of the staining experiment for one of these mice 
(A) as well as a control mouse carrying only the reporter allele, but lacking the 
pCAGGS-C31CNLS-pA transgene (B). Clear staining can be detected in the 
maturing sperm cells in about 50% of the tubules with the proportion of 6- 
galactosidase expressing cells ranging between 10 and 100. No staining could be 
detected for the control mouse. This dearly demonstrates that C31-int-mediated 
recombination has taken place during spermatogenesis in the pCAGGS-C31CNLS- 
pA transgenic mice. These results show that the C31-int is functional in vivo, in a 




transgenic mouse system and therefore provides a new tool to introduce specific 
deletions, inversions or integrations into the mouse germline. 



